An auditory feature extraction method based on forward-masking and its application in robust speaker identification and speech recognition
نویسندگان
چکیده
1 This work is supported by National Nature Science Funds of China, the project number i Abstract: This article presents a new auditory feature extraction method, which considers the forwardmasking mechanism of auditory nerves and feasible in practice. Two features based on this method are extracted: FMFRC (forward masking firing-rate cepstrum) and FMSRC (forward masking synchronized rate cepstrum). Isolate-word speech recognition and text-dependent speaker identification experiments based on TI46 are conducted. The experiment results show that the new auditory features has comparable performance with MFCC under clean environment but far better noise-resistant property than MFCC in both tasks.
منابع مشابه
Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملSpeaker feature extraction from pitch information based on spectral subtraction for speaker identification
Robust speaker feature extraction under noise conditions is an important issue for application of a speaker recognition system. It is well known that LPC cepstrum, which expresses the spectral envelope, is e ective for speaker recognition. This implies that the spectral rough structure is e ective for speaker recognition. However, LPC cepstrum is a noise-sensitive feature. On the other hand, sp...
متن کاملRobust Auditory-Based Speech Feature Extraction Using Independent Subspace Method
In recent years many approaches have been developed to address the problem of robust speaker recognition in adverse acoustical environments. In this paper we propose a robust auditory-based feature extraction method for speaker recognition according to the characteristics of the auditory periphery and cochlear nucleus. First, speech signals are represented based on frequency selectivity at basi...
متن کاملتشخیص لهجه های زبان فارسی از روی سیگنال گفتار با استفاده از روش های استخراج ویژگی کارآمد و ترکیب طبقه بندها
Speech recognition has achieved great improvements recently. However, robustness is still one of the big problems, e.g. performance of recognition fluctuates sharply depending on the speaker, especially when the speaker has strong accent and difference Accents dramatically decrease the accuracy of an ASR system. In this paper we apply three new methods of feature extraction including Spectral C...
متن کاملRobust Text-independent Speaker Identification in a Time-varying Noisy Environment
Practical speaker recognition systems are often subject to noise or distortions within the input speech which degrades performance. In this paper, we proposed a new mel-frequency cepstral coefficients (MFCC) based speaker identification system with Vector Quantization (VQ) modeling technique. It integrates a hearing masking effect based masker and a group of dozen triflers into traditional MFCC...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000